OCR Xpress for Linux
OCR Xpress for Linux Functionality

OCR Xpress for Linux is an SDK designed for document recognition applications. OCR Xpress for Linux performs the task of recognizing printed characters in a digital image. The digital image can be captured into a BMP file format using scanners, cameras, or fax machines. Any uncompressed BMP file can be loaded into OCR Xpress for Linux for processing without any pre-processing requirements. Once the image is loaded, the application has three processing choices; to generate a searchable PDF file, to generate a text file, or to create a hierarchically structured data model of the image.

Figure 1 shows how OCR Xpress for Linux is laid out. The API manages three forms of interaction.

Figure 1: OCR Xpress for Linux Model

Figure 2: Organization of text lines, words, and characters in a text block

An application has access to all this structured data via the Results Manager. The “Results Manager and Get Functions” of the “How To” section goes into the details of how to access and use this data. In order to construct the Internal Structured Data, the application just has to make two calls to the API; OCRX_load_file() and OCRX_recognize_to_memory(). The application would then use the OCRX_get_xxxx() functions to interrogate the Internal Structured Data and to retrieve data from it.

Once the Internal Structured Data is constructed, it remains persistent until a OCRX_free_document_result() call is made.

 

 

 


©2016. Accusoft Corporation. All Rights Reserved.

Send Feedback